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Abstract 

The paper presents an analysis performed over the worldwide top 150 firms in the 
pharmaceutical industry. It begins with a test of the Gibrat's Law of Proportionate Effect 
finding, in line with previous literature, a violation concerning the variance of the growth. 
Then it shows, using disaggregated data on sub-markets (defined according to the ATC 
code) that this violation can be completely referred to the existence of a "diversification" 
effect, namely a scale relation between the number of active sub-markets a firm posses 
and its size. The observed scaling property of the firms diversification patterns is in 
contrast with the linear assumption typically made in literature. Finally, to interpret 
the findings, the work proposes a stochastic model for firms diversification which fits 
quite well the data. 



1 Introduction 

In industrial economics the "stochastic" approach to the description of firms behavior has been 
introduced long ago with the advent of the Gibrat's "random walk" description of the business 
firm growth process ( pibrat (1931|) ; for an historical review see [Sutton (1997| )). Although this 



kind of analysis has been applied to the study of the time evolution of firms "size" (measured 
by diverse variables Uke sales or employees) with a certain degree of success, the persistent 
lack of data has prevented its application to other firm-specific relevant economic variables (as 
productivity, profitability or diversification, to cite a few). 

Moreover, the assumption of "identity" between the "objects" (firms) that form the sys- 
tem (industry or sector) implicit in the "statistical" description, while perfectly natural in 
contexts like biology, ecology or physics, where the "stochastic" models have found a large 
domain of application, seemed to many authors dissatisfying if applied to the description of 
economic phenomena. In fact, a large mainstream literature seems going exactly in the op- 
posite direction, trying to characterize industry dynamics starting from a set of firm-specific 
characteristics (both observable, as the firm's age [Mueller (1972 ) or unobservable, as its "state" 



inside a "natural" growth cycle [Greiner (1972[) ). These models, contrary to the "stochastic" 
ones, typically concentrate on variables, like productivity or profitability, considered more 
relevant to describe the actual firm "performance". It must be noted, however, that even if 
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these studies can be used to obtain "interpretative guidelines" (see for instance |Dosi et aL 



'll p95|) ) they would need, in order to be (dis)proved or at least completely exploited in their 
descriptive power, datasets far more complete that the ones available today. 

The present contribution can be considered halfway between the two previous approaches. 
Indeed, while it is based on "stochastic modeling" which, with its high degree of "essentiality" , 
has proved to be a powerful tool for an empirical based analysis of firms size dynamics, it 
extend such technique to an other, in same sense more "strategic", aspect of firms behavior 
constituted by their diversification patterns. In this sense it constitutes a first attempt to 
include, via an empirical grounded analysis, relevant firm-specific economic variables in the 
description of firm growth performances. 

The baseline model in stochastic firms growth dates back to the pioneering work of Gibrat 
who proposed a model, known as "Law of Proportionate Effect" (LPE), relating the size of a 
firm and its rate of growth by the expression 

5(t + 1) = (1 + (1) 

where S{t) is the firm's size at time t and i? is a random variable not dependent on S. This 
process reduces to a random walk in the log of size s(t) = log(S'(t)) and (under the usual 
assumption of validity of the Central Limit Theorem) predict an asymptotic log - normal 
size distribution. This "crude" model misses many important aspects of industrial dynamics 
(firms entry, exit, merging, etc.) and various refinements have been proposed (as an example 
see [Hart (196^ ) , ^imon and Bonini (1958|) , [Ijiri and Simon TT977D , gutton (19981) and recently 



Ljcrosky (1999 )) but it provides a robust framework for the interpretation of data. From its 
introduction the LPE has been empirically tested by many authors on different and heteroge- 
neous datasets (for a recent review see [Sutton (1997| )) generally obtaining two, in some sense 
opposite, conclusions: the Gibrat hypothesis is confirmed in first approximation by the lack 
of any relationship between the (log) size of firms and their rate of growth but is violated by 



a clear dependence of the growth variance on size (see, for instance Evans (1987 ); Hall (1987 ) 



Mansfield (19621) and more recently [Stanley et al. (19971) ; |Lee et al. (199^ )). 



A natural explanation for the reduction of growth variance with size could be a sort of 
"portfolio" effect (see eg. [Hymer and Pashigian (1962[ )). The basic idea is that a firm can 
be described as a collection of "atomic" components (line of productions, plants, etc.) of 
roughly the same size whose number would be proportional to the size of the firm. Under 
the assumption that the growth processes for different components are independent, the Law 
of Large Number (LLN) would predict a linear relationship between the firm size and the 
variance of its growth rate. The observed dependence is however milder and the failure of the 
LLN is usually inputed (see e.g. Poeri (1989[) ) to the existence of a "relation" between the 



firm components that makes the aggregation of the "atomic" growths not simply additiveQ A 
model has recently been proposed [Stanley et al. (1997[ ); [Lee et al. (1998[) , based on a supposed 
intra-firm complex hierarchical structure, which, opportunely tuned, reproduce quite well the 
observed behavior. On the contrary, I will show in what follows that, if one correctly identifies 
the "atomic" contributions to growth and the scaling relation between their number and the 
firm size, the LLN does a good job in explaining the observed Gibrat violation, at least as far 
as the database under analysis is concerned, without any need of intra-firm "structure" . 

The analysis presented in the following is based on the dataset PHID (Pharmaceutical 
Industry Database )0 that covers the top 100 firms relative to the seven major western markets 

^The sole introduction of correlation in growth components is clearly not enough. 
^Developed at the University of Siena 



2 



2 
1.5 
1 

0.5 


-0.5 
-1 
-1.5 



<h O O O ^ 



o o o o <> 



H ^ ^ ^ h 



H ^ h 



[t B B H- 



-e B B H g B E] 



Standard dev. — <^ 
Skewncss — h- 
kurtosis — B- 



10 



Figure 1 : Moments of the g distribution computed at different times 



(USA, United Kingdom, France, Germany, Spain, Italy and Canada) in tlie period ranging 
from 1987 to 1997. The dataset provides the firms sales in USD, disaggregated up to the 
4-digit-level of the Anatomical Therapeutic Classification scheme (ATC) in 517 micro classes. 

The firms under analysis are obtained aggregating the regional figures and merging from 
the beginning the sales pertaining to firms that belong to the same entity at the end of period. 
Moreover, I consider only the 150 so obtained firms with the top sales at the beginning of the 
period^. 

In the next section I present a brief overview of the statistical properties of the size dis- 
tribution and of the growth process. A wider analysis is presented in Bottazzi et al. (2000 ), 
here I concentrate the attention on the violation of the Gibrat model emerging as a relation 
between the variance of growth and the size of a firm. In Section |^, I perform a deeper 
analysis of this relation, using disaggregate data on the "sub-markets" defined by the 4-digit 
ATC code. I am then able to reduce the Gibrat violation to a "diversification" effect, due to 
a scale relationship between the firm size and the number of its active markets. Finally in 
Section ^, I propose a model, inspired by the structure of the "technological evolution" in the 
pharmaceutical industry, which accounts for the observed pattern of firm's diversification. 



2 Data analysis 

Let Si{t) be the size of firm i {i G [1, . . . , A^]) at time t (z e [0, . . . ,T]) where (see Sec. |l|) 
A^ = 150 and T = 10 and let define the 'normalized sizes" as Gi{t) = Si{t)/ < Si{t) >. This 
variable and its log gi{t) = \og{Gi{t)) are characterized by distribution functions that can 
be considered stationary in time. The reliability of this assumption can be checked plotting 

■^This cut is imposed to minimize the possibility of selection bias due to the different "relative" sizes of 
national markets. It does however introduce the interpretative problem of studying the evolution of a subset 
of firms that, at later time, are in general sparse in the ranked set. I have checked that this effect is almost 
negligible, due to the low rank mobility of the industry, yet it is present and will shape the lower tail of the 
size distribution. 
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Figure 2: Distribution function for the firm "normalized size" g. The three curves correspond 
to the distribution obtained for t = 0, 5, 10. 



the moments of the distribution as a function of time (see Fig. |l|): a part from a constant 
mild increase in the variance of the distribution, the approximation turns out to be good. 
However the major contribution to the variance increase comes clearly from the low g region 
( see Fig. ||) where a "spurious" (due to the selection bias) diffusion toward smaller sizes is 
present. This diffusion is responsible for the observed increase in variance. Once understood, 
this effect is statistically irrelevant and in the following analysis the g variable is thought 
identically distributed at every time step.0 

As a first step in the search for a violation of the LPE it is natural to investigate the 
possible presence of a relation between the first moments of the growth distribution and the 
size. One can proceed in a straightforward way: said hi{t) = gi{t + 1) — gi{t) the growth of 
firm i at time t, one bins all the firm according to their size gi{t) in equally numbered sets, 
and computes the mean, standard deviation and 1-lag autocorrelation of the growth hi{t) for 
each set separately. The results are reported in Fig. ^ against the average size of each bin. 

Both the mean growth and the autocorrelation do not show any particular pattern but a 
clear dependence of the growth variance on size emerges and The Law of Proportionate Effect 
is violated. Fitting the relation between variance of growth and size with an exponential law 

aih) ~ e'^s (2) 

one find a value (3 = .2 H — .03 that is striking similar to the one found in other analysis on 
different datasets Stanley et al. (1997| ) |Lee et al. (1998|) and much lower then the "portfolio" 



prediction = .5 discussed in Sec. ||. 

In what follows I will show that the LLN is, however, sufficient to completely characterize 
the relation in (^. The key point will be to look at the whole firm growth as an "aggregation" 
of its growth in the different sub-markets in which it operates. 

^By the way, note that the transition from the S to the G variables, makes the discussion insensitive to any 
prices variation that is constant over all the pharmaceutical products (for instance, the monetary inflation or 
any price dynamics relatives to pharmaceutical industry). 
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Figure 3: Mean, standard deviation and autocorrelation of growth h computed for different 
size bins plotted against the average size in the bin. The exponential fit to the standard 
deviation gives a value 7 = —0.20-1- / — 0.03 



3 Diversification as a source of Gibrat violation 

One of the peculiar feature of the PHID database is the possibility of disaggregate the sales 
figures until the 4th digit of the ATC code. This level of disaggregation allows to identify 
sub-markets that are "specific" enough to be considered, roughly speaking, the loci of compe- 
tition between firms: the products belonging to a given sub-market posses similar therapeutic 
characteristics and can then be considered substitutable while products belonging to different 
sub-markets are usually targeted to different pathologies. Moreover, the data on licensing 
agreements [Orsenigo et al. (2000|) show that this disaggregation level is also the one at which 



single research projects develop. These characteristics lead to the conclusion that the dif- 
ferent "therapeutic sub-markets" provide the natural "scale" at which the "firm's growth" 
phenomenon must be studied. 

The analysis in this section is limited to the variance-size relation characterizing the growth 
process of firmsQ It turns out that the Law of Large Number is in fact responsible for this effect 
if one assumes as "atomic" contribution to the firm's growth the growth of firm's sales in the 
different sub- markets. This result comes from two non-trivial observations: first, correlation 
across sub-market is negligible and second, the number of active sub-markets of a given firm 
is on average increasing with its size. 

To make the argument clearer, let me introduce a bit of notation. Let Sij{t) be the size of 
firm i in sub-market j at time t. The aggregate size of the i-th firm is the sum of its size on all 
the sub-market Si(t) = Sij{t) and the aggregate growth defined in (|l|) can be rewritten as 

If one computes the correlation of the ratios -|- 1)/Si{t) for all the firms in all the 



^For other aspect of the sales dynamics in sub-markets see [Bottazzi et al. (2000 ) 
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Figure 4: Results of various statistics computed on groups of firms binned according to their 
size. On the x-axis is the log of the average size of the bin. (a) Mean and (b) variance of Rij{t) 
for different sub-markets j. (c) Mean and (d) variance of Aj j(t) for different sub-markets j. 



sub- markets, i.e. among all the possible couples of indeces i and j, one obtains a distribution 
centered around zeroQ. The growth processes on different sub-markets can be considered to any 
extent uncorrelated and the variance of the aggregate growth is obtained adding the variance 
of the growth in each sub-market. 

In order to keep the present analysis consistent with the results of previous Sections it is 
necessary to switch to the "rescaled size" Gi{t). This can be done straightforwardly but for 
clarity purposes let me introduce the new variables Ri,j{t) = Sij{t + 1)/Sij{t) and Aij{t) = 
Ni{t) Sij{t)/ Si{t) where Ni{t) is the number of sub-markets in which firm i operates at time t 
(active sub-markets). Then using the variance of the "normalized growth" can be written 
as 



yaTi^t[Hi{t)] = vaii^t 



cm 



varii 



M{t) 
[M(t + l) 



J 



(4) 



where M{t) is the average (aggregate) size of firms at time t and the ratio M{t)/M{t + 1) is 
a normalization factor (proportional to the rate of growth of the total industry). I used the 
short notation vaii^t to denote the variance of the distribution obtained, consistently to what 
done in previous Sections, using the complete panel (all the firms at all the time steps). 

In (1) the contribution of each sub-market factorizes in three terms: Rij{t), which is the 
actual growth of the firm i in sub-market j; the inverse number of active markets l/iVj(t) 
and Aij{t), which is a weighting coefficient describing the "diversification" heterogeneity of a 
firm: if the firm i at time t is symmetrically diversified over its sub-markets, the Ai,j{t) (for 
different j) are concentrated around 1, otherwise if the firm is asymmetrically diversified, the 

^With a standard deviation of 0.000388 and an average deviation of 0.000024 
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Figure 5: Average number of active sub- markets. The fit is a straight line with slope a 
.39 + -.02 and intercept g = 6.0 + -0.12 



Aj j(t) are broadly distributed. The mean and variance of the distributions for Rij{t) and 
'^ijit) obtained using different bins in the aggregate size do not show any clear dependence 
on the average size of the bin (see Fig. Then the term iVj(t), the number of active sub- 
market a firm posses, must be the sole responsible for the observed dependence of variance 
over the aggregate size. Indeed, fitting on a log-log scale the average number < Ni(t) > of 
active markets for each bin against the average size of the bin (see Fig. |^) one obtains a slope 
a = .39 H — .02 and an intercept q = 6 -\ — 0.12. The Law of Large Number would predict a 
relation between the exponent in (^) and the slope in Fig. ^ of the form f3 = —a/2 which is 
in perfect agreement with the dataQ. 

The conclusion is that the relation provided by the Law of Large Number is valid as long as 
one consider the actual number of sub-markets a firm operate in. It must be stress, however, 
that in order to demonstrate this statement, it has been necessary to rule out two possible 
sources of functional dependence between a firm's size and the variance of its aggregate growth: 
the possibility that the variance or the mean of a firm growth in a given sub-market depend (on 
average) on its total size and the possibility that the diversification patter of a firm (described 
by the variable Aj) varies (on average) with its size. Both these possibilities are actually 



discussed in literature [Hart and Prais (1956|) and proposed as possible source of Gibrat Law 
violation. 

It remains however to explain why the number of active markets a firm possesses, shows a 
so clear dependence on its size N{G) ~ and what is the meaning (if any) of the parameter 
A. In the next Section I propose a model for the diversification process of business firms, 
essentially based on the assumption of a "self-reinforcing" effect concerning the "proliferation" 
of sub-markets a firm operates in, which provide a simple but suggestive interpretation of 
Fig. |. 



^Notice that a weak relation appears between the variance of Aij and the total size. A linear fit provide a 
slope .09 that is negligible if compared to the effect due to the number of active sub-markets. 
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4 A stochastic branching model for firm diversification 



The previous analysis shows that larger firms are present, on average, on a greater number 
of sub-markets. For the purposes of this Section it is useful to read this statement in a 
"dynamical" way: as firms grow their activities becomes more and more diversified as they 
sell products on different sub-markets. 

In what follows I will try to capture the diversification "behavior" of firms with a model 
describing how the number of active markets (or better, the probability of having a given 
number of active markets) changes as firm grows. Notice that I'm not interested in a "dynam- 
ical" process in time, i.e. a model describing the evolution of diversification structure "as time 
goes by" . This is because the growth dynamics of the same firm could be very heterogenous 
if observed at different time steps (actually, in this sector, it is, see. [Bottazzi et al. (2000| )) and 
it would be difficult to maintain that shrinking firms behave, with respect to diversification, 
as growing ones. Rather, I will treat some "size" variable as independent and describe the 
change in diversification patterns as this quantity is varied. Due to the multiplicative nature 
of the growth, the natural candidate to play the usual role of "time" is the log of firms size g. 
The object of the following analysis thus becomes the probability that a firm which possesses 
n active markets when its (log) size is g, will possess m active markets when its (log) size is 
g' > g, P{n,g;m,g'). 

Before proceeding with the model building, let me briefly mention a collection of mod- 
els that can be considered a previous attempt to merge both the "diversification" and the 
"growth" aspect of firm's dynamics. These models, collectively referred as "island models", 
was originally proposed by Simon (see [Ijiri and Simon (1977|) ), and can be considered a "clas- 
sic" in the stochastic growth theory of firmsQ There the "diversification" dynamics is not 
directly analyzed, but rather serves as a "driving" process that "sets the pace" for the firms 
growth process, described as a successive capture of diverse "islands", or "business opportu- 
nities". In this way the distinction between "growth" and "diversification" becomes rather 
vague, but nevertheless two assumptions seem to be generally accepted concerning the latter. 
The first is on the nature of the "diversification" events (here the entry on either a previously 
unexplored or unexploited market): according to it, these events are seen as "shocks" (in- 
vestment opportunities, technological achievements, etc.) that can "happen" to firms along 
their histories with mutual independence. The second is to consider these "shocks" uncorre- 
lated in time. i.e. to assume that they could happen with the same constant probability in 
any instant, irrespectively to the actual firm "state". If one neglects the possibility of "in- 
stantaneous" multiple shocks^, the previous assumptions provide a complete definition of the 
transition probability P{n, g;m,g + 6) between two sizes differing by an infinitesimal quantity 

where A is the constant "shock rate" (the average number of "diversification events" for unit 
time); this form of the transition probability defines the well known Poisson process. A 
pictorial illustration of this process is shown in Fig. ^: the "blobs" stand for the diversification 
event "black boxes" that a firm meets along its growth. As far as this model is concerned, it 



^For their generality and robustness they have attracted attention also recently, for instance Sutton (1998 ) 
is mainly based on this kind of models. 

^Equivalently: requires the property of orderliness for the associated stochastic process. For a general 
discussion on the complete characterization of point processes see Snyder (1975) 



8 



Figure 6: Diversification by independent 
events. The interarrival "times" Sj — are 
exponentially distributed. 



Figure 7: Diversification by branching. The 
interarrival "times" are no longer exponen- 
tial, the distance between two events de- 
creases as the number of active markets in- 
creases. 



is irrelevant what really happens inside the back box, what matters is that after the "shock" 
the firm ends with one more active sub- market. 

The Poisson process would however predict a linear increase of the average number of 
active sub-markets with g, a property that is clearly in contrast with the empirical results 
discussed in the previous Sections^. In order to obtain a more satisfying model, it proves to 
be useful to explore inside the "black boxes" of Fig |], and try to describe, at least partially, 
the actual nature of the diversification events. A possible interpretation of these events comes 
from the knowledge about the "technological" behavior of the industry Prsenigo et al. (20001) . 
The growth in pharmaceutical industry is highly "research driven" so that it is plausible to 
think to the diversification too as an outcome of "technical improvements" : a firm enters a new 
market when it has the (technical) capability of developing products for this market. Moreover 
previous studies over a wide set of patent agreements [Prsenigo et al. (2000| ) showed that, for 



what concerns this sector, the technical knowledge proceeds with successive refinements with 
new results bringing in new possibilities of technological advancements. Then, the simplest 
way of mapping the resulting "technological behavior" on the diversification dynamics is to 
suppose that the diversification process proceed as a branching process, where each opened 
branch (sub-market) becomes eventually the origin of a new branching (diversification) event. 
A picture of this process is shown in Fig. |^ under the (minimal) assumption that the branching 
is uniformly binary. If one neglects the topology in sub-markets space that emerges from these 
successive branchings, but is interested only in the actual number of these sub-markets, the 
process in Fig. ^ can be readily described: all the active sub-markets can be sources of a 
possible "diversification" event "a la" Poison, and the (^ must be modified to read 

/'(n..;™.«+.)^{:,^f-°<^) (6) 

where again multiple instantaneous branchings are neglected. If Pn{g) is the probability that 

-•^^Notice that one could also ac count for the observed behavior using a Poisson process with a non linear 
intensity function t(G) = (see Snyder (1975| )). This approach seems however rather unnatural and would 



provide a worse description for the overall statistics, see Fig.g 

Roughly speaking, this means that a relative large amount of firms budget is invested in formal R&D 
activities and these activities generate, via products innovations, increasing competitiveness for the firms 
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a firm of size g has n active markets it must satisfy the (pure-birth) set of equations 



Pn{g + S) = Pn-iig)Pin-l,g;n,g + 6) -pn{g)P{n,g;n,g + 6) n > Uq 
Pnoig + S) = -Pmig) P{no,g;no,g + 6) n = no 

where no is the initial number of active markets. Substituting in (0) the definition given in 
(^ and taking the hmit 6—^0 one obtains the set of differential equations: 

Pnig) = -nXpn{g) + {n-l)Xpn-l{g) U > Uo 

P'noig) = -noXpnoig) n = no 

with the initial conditions: 

where go is the initial size of the firm. The previous process is known as the Yule process 
and has been originally proposed to explain the proliferation in time of animal species (for a 
discussion see [Feller (1968|) and reference therein). The system (|D can be easily solved (see 
Appendix 1) to obtain the following distribution: 

pM = f ~ M e-"°^(^-^») (1 - e-^(^~^°))"""'' , (10) 
\n - no J 

defined for g > go and n > uq. 

Let me turn now to the problem of describing the observed data with the proposed model. 



The distribution in (10) contains three parameters: the "diversification" rate A, the initial 
number of active sub- markets uq and the initial firm size Sq. Since it correctly predicts an 
exponential increase in the average number of active markets with size 

< N{g) >= nPn{g) = noe^(^'-^°) , (11) 

n=no 

one can use the linear fit shown in Fig. ^ to obtain an estimate of the parameters in ([TT|), 
requiring the fulfillment of the two conditions X = a = .39 and log(no) — Xsq = q = —6. 

The descriptive power of the model, however, must be judge using more distinguishing 
object than the average < N{s) >. A good candidate is the diversification pattern of the 
whole industry. To be precise, let P(Mo < M < Mi) be the probability that a firm posses 
a number of active market between Mq and Mi. This quantity can be evaluated using the 
actual frequency of occurrences: 

Ah N T-1 
m=Mo 1=1 t=0 

where ni{t) is the number of active market of firm i at time t and 6 is the Kronecker delta. 
There is, however, an alternative approach for the computation of the previous quantity: it 
can be obtained via a simple convolution 

P(Mo<M<Mi)= J2 / dgfig)prnig) (13) 

m=Mo -^30 



10 



0.45 



0.4 



observed 
Yule 
Poisson 



0.35 - 
0.3 - : 
0.25 - 1 



0.2 - 



15 

0.1 - j : j : j : _ 

0.05 - j = j = j = j = I 

U i ^ I - ^ I \ S_Ej \ S_Ej \ S_5 \ S_E \ S_c I SlE \ H_E C - 1 

20 40 60 80 100 120 140 160 180 200 220 



Figure 8: The binned probability density for the number of sub-market computed directly by 
the data and theoretically predicted by the Yule process. The fit showed was obtained with 
A = .39, uq = 5 and Sq = —12. As comparison, a fit of a Poisson model with a non linear 
intensity function t{G) = is also shown. 



once the size probability density f{s) and the "diversification" probability Pn{g) are known. 
An estimate of (0) can be obtained using the observed frequencies in data 

Ml N T-1 
m=Mo i=l t=0 

where gi{t) is the size of firm i at time t. Notice that this second method depends on the 
"branching" model to obtain an estimate of the probabilities P{Mo < M < Mi). A comparison 
between the observed P°'' and the predicted P^^ distributions provides a way to tune the 
residual degree of freedom, and an indication of the goodness of the model. As showed in 
Fig. 1^ the accordance, considering the extreme simplifications introduced, is good. 



5 Conclusions and outlook 

The relation between the variance of growth and the size, which constitutes the clearest and 
often reported violation of the Gibrat law has been reduced to a diversification effect: bigger 
firms operate in more sub-markets and the variance of their growth is consequently reduced^- 
I have shown that, in the industry under analysis, other possible effects (in particular the 

^^Incidentally, the observed relation between the number of sub-markets a firm posses and its size constitutes 
an original example of a "scaling" relation, and has to be added to other more famous example pertaining the 



domain of economics (for a review and critical discussion see [Brock (1999| )) 



11 



correlation among sub-markets and the dependence of the "strategic" diversification pattern 
of a firm on its size) if exist, are neghgible. 

The actual structure of the firm diversification can be described with good accuracy using 
the simple stochastic process proposed in Sec. |[ A caveat must be introduced, however, 
regarding the "interpretation" of this model: the strong technological component in industry 
growth suggests, but does not imply, that the main driving force in diversification would be 
some sort of "technological specification". The "penetration" of a firm in a new sub-market 
can actually possess various natures, nevertheless the proposed model is likely to describe the 
diversification djTiamics irrespectively of this variety. The reasons of this fiexibility are in the 
simplicity of the hypotheses. The model is indeed based on two general assumptions: first, 
the existence of some sort of "competencies" providing the firm with the ability to diversify its 
business, and second, that these "competencies" (and so the ability to enter a new sub-market) 
increase with the number of times they have been effectively used (and so with the number of 
opened sub-markets)|^ Both these assumption seem, in most cases, so reasonable that would 
be difficult to find arguments against them. 

The previous model, thus, suggests an "evolutionary" description of firms which progres- 
sively "learn" to diversify, whatever the object of this learning would be. Incidentally, the 
observed relationship between the growth variance and the size of the firms, being milder 
than the LLN prediction, gives evidence against the interpretation of "diversification" as a 
risk minimization strategy: indeed if this would be the case, firms have to be present on much 
more sub-markets then they actually are. 

The present analysis can be extended in several directions: the first concerns the generality 
of the previous findings relating Gibrat's Law violation to a diversification effect. In Sec. |^ only 
one industry has been analyzed, but the observed trend in Fig. |^ is so similar to other results 
in literature ( [Stanley et al. (19"97D , [Lee et al. (1998] )) that a major multi-sectorial investigation, 
where possible, is advisable. Second, concerning the proposed model of firm diversification, 
it would be important to empirically investigate the nature of the "competencies" leading to 
penetration in new markets and their connection with the different characteristics of the firm, 
the more interesting probably being its "technological advantage". This investigation would 
constitute an important empirical support to the construction of evolutionary models of the 
firm (in line with what has been done, for instance, in Dosi et al. (1995 )). Finally, it would be 
interesting to apply the same description to diversification data from other industries. This 
would be a test of the model previously described, but, more interestingly, could constitute, 
via the introduction of industry-specific parameters, a "dynamical" way of characterizing 
diversification pattern in different sectors. 
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^•^Note that in the model the cumulative effect on "competencies" is described by a linear function. In 
general it would be probably possible to obtain a better agreement using more sophisticated relations and 
processes, for instance dropping the strictly binary nature of the branching or introducing the possibility of 
a branch "death" . This "finer" modeling would require, however, a higher "phenomenological" justification 
from the data. 
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APPENDIX 



A Solution of the Yule process 

To simplify the solution of (||) let me introduce a rescaled size t = X{g — go) and consider, 
instead of the probabilities in (H), the variables Unit) defined by 

= e-"*i/„(t) . (15) 

In these new variables the set of equation in (||) reduces to: 

Vnit) = {n - 1) yn-i{t) n > uq , . 

y'n^it) = n = no. ^'^^ 

with initial conditions: 

l/n(0) = ( n' " r° (17) 
From (|16D, iterating over the index n, it is immediate to write the solution as multiple integral: 

rt rtl ptn-nQ + l 

Vnit) = {n-l){n-2)...no dh dh . . . rft„_„„e*i+*^+-+*"-"o (18) 

Jo Jo Jo 

Notice that, due to the complete symmetry of the integrand, the multiple integral over the 
n — no- dimensional hyper-cube of side t with the constraints t„_no < . . . < ^2 < < ^ reduces 
to the integral over the whole hypercube divided by all the possible ordering of the constraints, 
which are (n — no)!. One thus obtains: 

l/n(t)= f''"M(e*-l)"-"°, (19) 
\n - no J 

that, remembering the factorization in (|l^) and substituting t with its previous definition, 
reduces to ([ToD after obvious algebra. 
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